A NOTE ON CONDITIONED GALTON- WATSON TREES 

ROMAIN ABRAHAM AND JEAN-FRANgOIS DELMAS 

CO ' Abstract. We give a necessary and sufficient condition for ttie convergence in distribution 

, of a conditioned Galton- Watson tree to Kesten's tree. Tliis yields elementary proofs of 

>— ^ ■ Kesten's result as well as other known results on local limit of conditioned Galton- Watson 

trees. We then apply this condition to get new results, in the critical and sub-critical cases, 

Vh , on the limit in distribution of a Galton- Watson tree conditioned on having a large number 

•H^' of individuals with out-degree in a given set. 
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1. Introduction 

Pm I Galton- Watson (GW) process is a very simple model of population growth where all indi- 

viduals give birth independently from each others to a random number of children with the 
i-£^ , same reproduction law p. This population growth can be described by a genealogical tree r 

Cd ' that we call the GW tree. It is well-known that, in the sub-critical case (the mean number 

of children of a single individual is strictly less than 1) and in the critical case (the mean 
number of children of an individual is 1), the population becomes a.s. instinct. However, one 
can define in these two cases the tree r*, called Kesten's tree, conditioned on non-extinction 
^ . as the local limit in distribution of the tree r conditioned to reach height n, when n tends to 

'n I infinity, see Kesten [9]; this result is recalled here in Section [2.41 

It is also known that, at least in the critical case, other conditionings such as conditioning 
^ ! by the total progeny, see Kennedy [8] and Geiger and Kaufmann [3], or by the number of 

— u ■ leaves, see Kortchemski |10] or Myllari [12] for associated works, lead to the same local 

(^ . limit in distribution. See also the survey from J ANSON [6]. 

fO I For all those cases, the conditioning event can be written {r € A„} with A„ of the form: 

A„ = {t, A{t) > n} or A„ = {t, ^(t) = n}, 

where A : t >-^ A{t) is a functional defined of the set of trees and satisfying an additive 
property, see Equation ([9]). We prove our following main result, see Theorem 13. II for a precise 
^.' statement, which unifies all the previous conditioning and gives a necessary and sufficient 

condition to obtain Kesten's tree as a limit. In the non-degenerate critical case, if A satisfies 
the additive property Q with some aperiodic condition, then the following two statements 
are equivalent: 

• lim„^+ooP(r G A„) = and lim„_,+oo P('r G A„+i)/P(t G A„) = 1, 

• The distribution of r conditionally on {r G A„} converges to the distribution of 
Kesten's tree r*. 

Using this result, we give elementary proofs for the convergence in distribution to Kesten's 
tree r* of the GW tree conditioned on: 

(i) Extinction after or at a large time (sub-critical and critical case), with A(t) = H{t) 
the height of the tree t and conditioning event {H{t) = n} or {H{t) > n}. See 
Sections O and 221 
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(ii) Large total population size (critical case), with ^(t) = Card (t) the total size of the 

tree and conditioning event {Card (r) = n} or {Card (r) > n}. See Section l43l 
(iii) Large number of leaves (critical case), with ^(t) = -Lo(t) the total number of leaves 
of t and conditioning event {Lq{t) = n} or {Lo{t) > n}. See Section H7il 

In fact the conditioning on the large total population size or on the large number of leaves 
are particular cases of tree conditioned on large number of individuals with a given number 
of children. This corresponds to the functional A(t) = L^(t) which gives the total number of 
individuals of the tree t whose number of children belongs to a given set A. Such conditioning 
has already been studied by RizzOLO [H], see also MiMAMi [11], but for global scaling limit 
and not local limit. We obtain the convergence in distribution to Kesten's tree r* of the GW 
tree conditioned on: 

(iv) Large number of individuals with number of children in a given set A (critical case) , 
with A(t) = -L^(t) and conditioning event {Lj[(t) = n} or {Lj[{t) > n}. One has to 
distinguish according to G ^, Section 15.11 and ^ A, Section 15. 2[ 

Let us remark that the total progeny ^ = N, the number of leaves A = {0} and the number 
of internal nodes >4^ = N \ {0} are particular cases of this conditioning. 

We then study the subcritical case and define a one-parameter family {pe,d € /) of dis- 
tributions on the set of integers such that the GW tree r associated with the reproduction 
law p and the GW tree Tg associated with the reproduction law pg have the same conditional 
distributions given L_4, see Proposition 15.61 This generalizes Kennedy's transformation J8] 
concerning the total progeny, and the pruning of Abraham, Delmas and He [IJ concerning 
the number of leaves. We then immediately deduce, see Corollary 15.81 that if there exists 
^0 such that pg^ is critical, then the distribution of r conditionally on {Lj({t) = n} or on 
{L_4(r) > n} converges (in the aperiodic case) to the distribution of the Kesten's tree Tq 
associated with the reproduction law pg^. When there is no such ^O; then a condensation 
phenomenon may appear, see [Tj or f6j . 

Finally, we consider another conditioning which does not enter into the framework of 
Theorem 13.11 : conditioning on the size on the n-th generation. However, we can adapt the 
proof of Theorem 13.11 to get an analogous result in that case, see Proposition 16.11 We apply 
this result to a geometric reproduction law where explicit computations can be performed 
to prove that the corresponding tree conditioned on the n-th generation being on size n 
converges in distribution to Kesten's tree. 

The paper is organized as follows. In the Section [2l we recall the framework we use for 
discrete trees and define the GW tree r and Kesten's tree r* associated with a reproduction 
law p. In Section [31 we state and prove the necessary and sufficient condition for convergence 
in distribution of the conditioned tree to Kesten's tree. We apply this result in Section [H to 
recover the classical results on critical conditioned GW trees and we study in Section [5] the 
case of the number of individuals with out-degree in a given set for the critical and sub-critical 
case. Finally, we study in Section [6] the conditioning on the size of the n-th generation of the 
tree. 

2. Technical background on GW trees 

2.1. First notations. We denote by N = {0, 1,2,...} the set of non-negative integers and 
by N* = {1,2,...} the set of positive integers. 

If iC is a subset of N*, we call the span of K the greatest common divisor oi K. If X is an 
integer-valued random variable, we call the span of X the span of {n > 0, ¥{X = n) > 0} 
the restriction to N* of its support. 
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2.2. The set of discrete trees. We recall Neveu's formalism [13j for ordered rooted trees. 
We set 

U=\J (N*)" 

n>0 

the set of finite sequences of positive integers with the convention (N*)'^ = {0}. For u £ U 
let |n| be the length or generation of u defined as the integer n such that u G (N*)". If u and 
V are two sequences of U, we denote by uv the concatenation of the two sequences, with the 
convention that uv = u ii v = ^ and uv = v ii u = fl). The set of ancestors of u is the set: 

(1) Au = {v £ U; there exists w ^U, w ^ ^, such that u = vw}. 

The most recent common ancestor of a subset s oiU, denoted by MRCA(s), is the unique 
element u of HnGs^" with maximal length \u\. 
A tree t is a subset of hi that satisfies: 

• 0Gt, 

• If « € t, then A^ C t. 

• For every u € t, there exists a non-negative integer A;„(t) such that, for every positive 
integer i, tii € t iff 1 < i < ku{t). 

The integer A;„(t) represents the number of offsprings of the vertex n G t. The vertex n € t 
is called a leaf if /cu(t) = 0. The vertex is called the root of t. Let us remark that, for a 
tree t, we have 

(2) ^fc„ = Card(t)-l. 

uet 

Let t be a tree. The set of its leaves is Co{t) = {n G t; ku{t) = 0}, its height is defined by 

H{t) = sup{[u|, n € t} 

and can be infinite. For n G t, we define the sub-tree 5u(t) of t "above" u as: 

5„(t) = {v eU, uv e t}. 

Notice that by construction MRCA(5tj(t)) = u. 
We denote by T the set of trees, by 

To = {t G T; Card (t) < +00} 

the subset of finite trees, by 

T^'^) = {t eT;H{t) <h} 
the subset of trees with height at most /i G N, and by 

Ti = {tGT; lim |MRCA({n G t; |n| = n})| = +00} 

n— >+oo 

the subset of trees with a unique infinite spine. Notice that Tq and T*^^ are countable and 
Ti is uncountable. For /i G N the restriction function r/j from T to T is defined by: 

r/j(t) = {n G t, \u\ < h}. 

We endow the set T with the ultrametric distance 

d(t t') = 2~™^^'f^^^' '"h(t)=^'h (*')}_ 

A sequence (t„,n G N) of trees converges to a tree t with respect to the distance d if and 
only if, for every /i G N, 

^ft(tn) = ^/i(t) for n large enough. 
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The Borel u-field associated with the distance d is the smallest cr-field containing the single- 
tons for which the restrictions functions {vh, /i € N) are measurable. With this distance, the 
restriction functions are contractant. Since Tg is dense in T and (T, d) is complete, we get 
that (T, d) is a Polish metric space. 

Consider the closed ball B{t, 2'^) = {f G T; d(t, t') < 2-^} for some t G T and /i G N and 
notice that: 

i?(t,2-'^)=r,-i(K(t)}). 

Since the distance is ultrametric, the closed balls are open and the open balls are closed, 
and the intersection of two balls is either empty or one of them. We deduce that the family 
((r^ ({t}),t G T^ '),h G N) is a vr-system, and Theorem 2.3 in [2] implies that this family 
is convergence determining for the convergence in distribution. Let {Tn,n G N*) and T be 
T-valued random variables. We denote by dist{T) the distribution on the random variable 
T (which is uniquely determined by the sequence of distributions of rh{T) for every /i > 0), 
and we denote 

dist(Tn) — > dist(T) 

n— >+oo 

for the convergence in distribution on the sequence {Tn,n G N*) to T. 

We deduce from the Portmanteau theorem that the sequence {Tn,n G N*) converge in 
distribution to T if and only if for ah /i G N, t G T^^^: 

lim FirhiTn) = t) = P(r;,(r) = t). 

n— >+oo 

As we shall only consider Tg-valued random variables that converge in distribution to a Ti- 
valued random variable, we shall give an other characterization of convergence in distribution 
that holds for this restriction. To present this result, we introduce some notations. If t,s G T 
and X G Co{t) we denote by: 

t ® (s, x) = {n G t} U {xv, u G s} 

the tree obtained by grafting the tree s on the leaf x of the tree t. For every t G T and every 
X G >Co(t), we shall consider the set of trees obtained by grafting a tree on the leaf x of t: 

T(t,x) = {t®(s,x), sgT}. 

It is easy to see that T(t,x) is closed. It is also open, as for all s G T(t,x) we have that 
5(s,2-^W-i)cT(t,x). 

Moreover, notice that the set Ti is a Borel subset of the set T. 

Lemma 2.1. Let {Tn,n G N*) and T be T-valued random variables which belong a.s. to 
ToU'^^i- ^^6 sequence {Tn,n G N*) converges in distribution to T if and only if for every 
t G To and every x G Co(t), we have: 

(3) lim P(T„ G T(t, x)) = ¥{T G T(t, x)) and lim P(r„ = t) = ¥{T = t). 

n— >+oo n— >+oo 

Proof The subclass T = {T(t,x), t G Tq, x G >Co(t)} U {{t}, t G Tq} of the Borel sets on 
To U Ti forms a 7r-system since we have 



T(ti,Xi)nT(t2,X2) 



fT(ti,xi) iftiGT(t2,X2), 

ir(t2,X2) ift2GT(ti,Xi), 

{ti} if ti = t2 and xi ^ X2, 

in the other cases. 
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For every /i G N and every t S T^^' , we have that t' belongs to r^ ({t})P|Ti if and 
only if t' belongs to some T(s, x) where x is a leaf of t such that |x| = h and s belongs 
to r^ ({t})P|To such that x is also a leaf of s. Since Tq is countable, we deduce that J^ 
generates the Borel u-field on Tq U Ti. In particular J^ is a separating class. 

Since A (^ T is closed and open as well, according to Theorem 2.3 of [2], to prove that the 
family J^ is a convergence determining class, it is enough to check that for all t S Tq IJ I'l 
and /i € N, there exists A £ J^ such that: 

(4) t£AcB{t,2-''). 

If t G To, this is clear as {t} = B{t,2''') for all h > H{t). If t G Ti, for ah s G Tq and 
X G Co{s) such that t G T(s, x), we have t G T(s, x) C B{t,2~'^'). Since we can find such a 
s and X such that |x| is arbitrary large, we deduce that (JH is satisfied. This proves that the 
family J-" is a convergence determining class. 

Since, for t G Tq and x G Co{t) the sets T(t,x) and {t} are open and closed, we deduce 
from the Portmanteau Theorem that if (r„,n G N*) converges in distribution to T, then ([3]) 
holds for every t G Tq and every x G Co{t). D 

2.3. GW trees. Let p = {p{n),n G N) be a probability distribution on the set of the non- 
negative integers. We assume that 

+ 00 

(5) p(0) > 0, p(0) +p(l) < 1, and m := y^np{n) < +oo. 

n=0 

A T-valued random variable r is a Galton- Watson (GW) tree with reproduction law p if 
the distribution of A:0(t) is p and for n G N*, conditionally on {A;0(r) = n}, the sub-trees 
(5i(r),52(r), . . . ,5„(t)) are independent and distributed as the original tree r. Equivalently, 
for every /i G N* and every t G T*^ -*, we have 

F{rh{T)=t)= H p{K{t)). 

In particular, the restriction of the distribution of r on the set Tq is given by: 

(6) VtGTo, F{T = t) = Ylp{K{t)). 

wet 

The GW tree is called critical (resp. sub-critical, super-critical) if m = 1 (resp. m < 1, 
m > 1). 

2.4. Conditioning on non-extinction. Let p be a reproduction law satisfying Assumption 
([5]) with m < 1 (i.e. the associated Galton- Watson process is critical or sub-critical). We 
denote by p* = {p*{n) = np{n)/m,n G N) the corresponding size-biased distribution. 

According to Kesten [9], we can define an infinite random tree r* which can be viewed as 
the tree r conditioned on non-extinction in the following sense: 

V/i G N*, Vt G T^"), P(r/,(T*) = t) = lim P(r,,(r) = t I H{t) > n). 



-oo 



The distribution of r* is as follows. Almost surely, r* belongs to Ti, that is there exists 
a unique infinite sequence {Vk,k G N*) of positive integers such that, for every /i G N, 
Vi- ■ -Vh G T* , with the convention that Vi ■ ■ ■ Vh = $ ii h = 0. The joint distribution of 
{Vk,k G N*) and r* is determined recursively as follows: for each /i G N, conditionally given 
(l/i, . . . , Vh) and Vhir*), we have: 
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• The number of children (^^(t*), w € r*, \v\ = h) are independent and distributed 
according to p if v ^ Vi- • -Vh and according to p* ii v = Vi ■ ■ -Vh- 

• Given also the numbers of children (ky^r*), u G r*, \v\ = h), the integer V/^+i is 
uniformly distributed on the set of integers {!,..., kvi-Vhi'^*)}- 

As a direct consequence we get that for all /i G N, t E T^ ' , u Gt such that \u\ = h: 

nrh{r*) =t,Vi---Vh = u) = m-^nrh{r) = t), 
and for all t € Tq, x € /3o(t): 

(7) P(r* E T(t, x)) = m-l=''lp(r G T(t, x)). 

Since, for t G Tq and x G £o(t), P(t = t) = P(t G T(t,2;), A:^(t) = 0) = P(t G T(t,x))p(0), 
we deduce that: 

(8) P(t* G T(t, x)) = — ^ P(r = t). 

3. Main result 

Let A be an integer-valued function defined on T satisfying the following additivity prop- 
erty: there exists an integer-valued function D defined on T such that, for every t G Tq, every 
X G >Co(t) and for every t such that A{t ® (t,a;)) is large enough, 

(9) A{t®{t,x)) = A{t) + D{t,x). 

Let no G N U {+00} be given. We define for all n G N*, the subset of trees 

A„ = {t GT;^(t) G [n,n + no)}. 

Typical values of no are 1 and -|-oo. 

The following theorem states that the distribution of the GW tree r conditioned to be 
in Aoo, the limit of A„, is distributed as r* as soon as the probability of A„ satisfies some 
regularity. We denote by 

dist(r|r G A„) 

the conditional law of r given {r G A„}. 

Theorem 3.1. Assume that Assumptions ^B^ and (0j hold as well as one of the two following 
conditions 

• m = 1 or 

• m <1 and D(t,x) = \x\ for all t G Tq, x G Co{t). 
Then, if 

(10) lim P(tgA„) = and lim ^i^l^!i±il = m, 

n-i>+oo n^+00 P(t G A„) 



we have: 



dist{T\T G A„) — > dist{T*), 

n— >+oo 



that is: 

(11) V/iGN*, Vt gT(^), lim P(r/,(r) = t I r G A„) = P(r,,(r*) = t). 

Conversely, if dist{T\T G A„) — > dist{T*) that is ill]) holds and if the span of {D{t,x);t G 

n— )-+oo 

Tq and X G /:o(t)}ni^* is one, then ^^ holds. 
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Proof. Let us first remark that, as we supposed that tti < 1, we have a.s. r € Tq and thus 
we are in the setting of Lemma 12. 1[ 

Using Q, we have for every t ^ Tq, x £ Coit) and t € Tq: 

P(t = t ® (t, x)) = ^ P(r = t)P(r = t). 

Let t G To and x € Co{t). We deduced that for n large enough, so that we can apply Equation 
©: 

teTo 

= ^^ ^ ^^'^ " *)^*^^ " *)l{n<A(t)+D(t,x)<n+no} 

teTo 

■~V = t)P(n - ^(t, x) < yl(T) < n + no - D(t, x)) 



p(0) 

= ml-lp(T* e T(t,x))P(r G A„_^(t,,)), 
where we used ([8]) for the last equality. Therefore we have 

(12) P(r G T(t,x) I r G A„) = P(r* G T(t,x))ml-I \^^ ^ ^^''^ ■ 

Then, using pU|) and that 15(1, x) = |x| if tti < 1, we obtain that: 

(13) lim P(r G T(t, x) I r G A„) = P(t* G T(t, x)). 

As lim„^+ooP(r G A„) = 0, we also deduce that for all t G Tq, P(t = t,T G A„) = for n 
large enough and thus: 

(14) lim P(r = t I T G A„) = = P(t* = t). 

We deduce from Lemma |2. II that (jlip holds. 

Conversely, if ([U]) holds, then Lemma [27T] implies that ([I3D and ([H]) hold. Then (fTi]) 
readily implies that the first part of (|10p holds. The fact that the span of {D(t,x);t G 
To and x G £o(t)} f] ^* i^ ^^^ ^^'^ ()12p imply, with Bezout theorem, that the second part of 
(fTUj) holds. D 

4. Examples 

4.1. Conditioning on extinction after large time. We give here a simple proof of 
Kesten's result for the convergence in distribution of a critical or sub-critical GW tree con- 
ditioned on non-extinction, see [10] . 



Proposition 4.1. Letr be a critical or sub- critical GW tree with reproduction lawp satisfying 
Assumption ^. Then, we have 

(15) dist{T\H{T) >n) — > dist{T*). 



n— >+oo 



Proof. Consider A[t) = H{t) and no = +oo that is A„ = {t G T; H{t) > n}. Notice that in 
this case for a tree t such that H(t) is larger than H{t), we have for every x G >Co(t) 

(16) ^(t®(t,x)) =^(t) + |x|. 

Therefore, Condition ([9]) is satisfied by A. 
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According to Theorem 13. 11 it suffices to prove 

(17) lim ¥(H(t) > n) = 0, lim -^-= — --^ = m 

to get (jlSp . As we are in the critical or sub-critical case, H{t) is finite a.s. and therefore, the 
first part of p!7|) holds. 

We denote by if the generating function of p and we define recursively ipi = ip and for 
n > 1, (fn+i = ^n°^- As ipn is the generating function of the distribution of {u G r; \u\ = n} 
the number of individuals at height n, we have P(t G A„) = 1 — (/9„(0). We also have 
hmn^+ooV'nlO) = 1 and 

P(t € A„+i) 1 - JfiMO)) /.1^ 

lim — — — ^ = lim — = if (I) = m 

n^+oo P(tGA„) n^+oo l-99„(0) ^^^ 

which is the second part of (J17p . D 

4.2. Conditioning on extinction at large time. 

Proposition 4.2. Letr be a critical or sub- critical GW tree with reproduction law p satisfying 
Assumption ^. Then we have 

(18) dist{T\H{T) =n) — > dist{T*). 

Proof. We consider A{t) = H(t) with uq = 1 that is A„ = {t € T; H{t) = n}. Since ^ is 
in force, we get that Condition ([9|) still holds. Again it suffices to prove 

Pfr = n-\- I) 

(19) lim ¥(H(t) = n) = 0, lim -— —^ = m 

^ ' n^+oo ^ ^ ^ ' n^+oo P(t = n) 

to get p8|) . Since r is a.s. finite, the first part of (|19p is immediate. 

Recall notation c/?„ introduced in Section l4.ll and that lim„_>.+oo '/'n.(O) = 1- We have 
P(t G A„) = (/Jn+i(0) - 93„(0) and: 

hm ^-^^A^= hm ^-^"(°^ , , \:,f' = "^ - "^ = m, 

n^+oo P(t e An) n^+oo ^ - ^^y('/^"(0)) 1 - m 

l-</3„(0) 



which is the second part of ([191) . D 

4.3. Conditioning on the total population size, critical case. We recover here results 
from [8] (under the hypothesis that Var (X) is finite) or Theorem 7.1 in [6] (under no condition 
on the variance of X) on the convergence in distribution of a critical Galton- Watson tree 
conditioned on the size of its total progeny to Kesten's tree. 

Proposition 4.3. LetT be a critical GW tree with reproduction law p satisfying Assumption 
^. Let d be the span of Card (r) — 1 (that is the span of the set {k > 0, p{k) > 0}). Then 
we have 

(20) dist{T\C&id (t) = nd + 1) — > distij*). and 

and 

(21) dist(r|Card (r) > n) — > dist{T*). 
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Remark 4.4. If we consider ^(t) = Card (t) and uq = +cx3 that is A„ = {t € T, Card (t) > 
n}, it is easy to adapt the end of the proof of Theorem 13.11 to get 

(22) hm IP(Card(r)>n + l) _^ 

^ ^ n^+oo P(Card (r) > n) 

Indeed, notice that the span of {D{t,x);t G Tq and x G /:o(t)} fll^*! with D{t,x) = A{t) 
given in (p3|) . is d. Then use that A^^+i = • • • = A^^+^+i to prove that ([2T]) imphes (j22]) . 
Notice it is not clear how we can obtain (|22p directly. 



Proof of Proposition \4-3[ Consider A{t) = Card (t) and no = d. Then we have 

A„ = {t G T; Card (t) G [n, n + d)}. 
We have for every t G T, without any additional assumption, 

(23) A{t®{i,x)) = A{i) + A{t), 
so Condition ([9|) holds. Again, it therefore suffices to prove 

(24) li,„ nCM (r) e |„, n + d)) = 0, li,„ "'^p//' ' '" V' " t ^t '" = ' 

n->+oo n->+oo lP(Card (r) G [n, n + dj) 

to get (PU|) . The first part of ([M|) is a direct consequence of Card (r) < +00 a.s. 

Before proving the second part of (fM|l . we first recall Dwass formula (see [3]). Let (r^, A; G 
N*) be independent GW trees distributed as r. Set Wk = Card (ta;). Let {Xk,k G N*) be 
independent integer-valued random variables distributed according to p. For A; G N* and 
n > k, we have: 

(25) P(Tyi + ... + Wk = n) = -P(Xi + . . . + X„ = n - A:). 

n 

We also recall some results on random walks. Let Y be an integrable random variable 
taking values in Z, such that E.[Y] = 0, ¥{Y = 0) < 1 and the span of |y| is 1. We consider 
the random walk S = {Sn,n G N) defined by: 

n 

(26) So = and Sn = ^Yk for n G N*. 

fc=i 

Then the randoin walk S is recurrent. If S is aperiodic, the strong ratio theorem for recurrent 
aperiodic random walks, see Theorem Tl p49 of \TE], gives that, for i £ Z: 

(27) lim '•'f--^; = Urn ^f ' ^ °> = 1. 

If S has period d, then for all A; G {1, . . . , d}, there exist jk G Z and n^ G N* such that 

(28) yn>nk, ^S^d+k = jk) > 0. 

The strong ratio theorem can then easily be adapted (using arguments similar to those used 
in the proof of Lemma 1531) to get that, for ^ G Z, A: G {1, . . . , d}: 

»"+~ P(S,^ = 0) 

By the definition of d, a.s. we have A{t) G (iN + 1. We consider an integer valued random 
variable X distributed according to p and we set y = X — 1 so that E[y] = since we 
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supposed that m = 1. The random walk defined by ()26p has period d and we can choose 
j'l = —1 in (j28p as ¥{Y = —1) > 0. Dwass formula (]25p implies that, for m = [(n — l)/d\: 

P(r G A„) = P(^(r) € [n,n + d)) = P(^(r) = md+ 1) = — L_^ P(S„rf+i = -1). 

md + 1 

Using (f29]l . we deduce that: 

^.^ P(r e A„+i) ^ ^.^ P(g(^+i)^+i = -1) ^ ^ 



-oo 



which implies readily the second part of ()24p . 

Let's turn now to the proof of ()2ip . As (iist(r|r € A„) is a mixture of (iist(T|Card (r) = fc) 
for k > n, we deduce from the first part of the proposition that (I2ip holds. D 



Remark 4.5. Notice that the local limit theorem gives asymptotics for P(S'„ = —1) when the 
distribution of X belongs to the domain of attraction of a stable law, see Theorem 4.2.1 of 
[5j or Theorem 1.10 in |10] . This gives asymptotics for P(r € A„) which in turns allow to 
recover Condition (IIOD. 



4.4. Conditioning on the number of leaves, critical case. For a finite tree t E Tq, we 
denote by -Lo(t) = Card {Co{t)) the number of leaves of t. The next proposition (which 
seems to be a new result) is in fact a particular case of the proposition of the next section. 
However, we prove it separately for pedagogical purpose as its proof and in particular the 
construction of the Galton- Watson tree that codes >Co(t) of Remark 14.81 are much simpler in 
that particular case. 

Proposition 4.6. Let t be a critical GW tree with reproduction law p satisfying Assumption 
1^. Let do be the span of the random variable Lq{t) — 1. Then we have 

(30) dist(r|Lo(r) = ndo + 1) — > dist{T*). 

and 

(31) dist{T\Lo{T) >n) — > distij*). 

Proof. We consider ^(t) = Lo(t) and no = do which yields A„ = {t G T; Lo(t) G [n, n + do)}- 
We have for every trees t, t G To and every x G >Co(t) 

(32) A(t®(t,x)) =^(t)+.4(t)-l. 

According to [11], see also Remark 14.81 below. Lq{t) is distributed as the total size of a 
critical GW tree tq with reproduction law given by the distribution of: 



N~l 



(33) Xo=yZ, 






k, 



k=l 
with {Zk,k G N*) and N independent random variables such that {Zk,k G N*) are inde- 
pendent and distributed as X — 1 conditionally on {X > 1} (where X is a random variable 
distributed according to p) and A^ has a geometric distribution with parameter p{0). As 
E[A'o] = 1, we get that tq is critical. Notice that do is also the span of the random variable 
Xq and that, if the span of X is d, then do = max(l, d — 1). 

We deduce from Subsection 14.31 that Condition (f2l]l is satisfied and thus: 

f'iA\ V wr^^^r ^^^^ n r F(-^o(r) G [n + l,n + 1 + dp)) 

34 Inn ¥{Lo{t) £ [n,n + do)) = 0, lim — — = 1. 

n^+oo n->+oo P(Lo(r) G [n, n + do)) 
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Then use Theorem 13.11 to get that (j30p holds. 
If we consider no = +00 that is: 

A„ = {t € To; Lo(t) > n}, 

arguing as in the proof of the second part of Proposition 14.31 we get (f3T]l . 

Remark 4.7. We deduce from Remark 14.41 that ([3T]) imphes 

P(Lo(r)>n+l) _^ 



D 



hm 

n— >+oo 



P(Lo(r) > n) 



Remark 4.8. We shah briefly recall how one can prove that Lq(t) is distributed as the total 
size of a Galton- Watson process by mapping the set of leaves Co{t) onto a GW tree, see [IT]. 
Let t be a tree. For n G t, we define the left branch starting from u as: 



Bl{u) = {uv; \v\ > I and t> = {1}I"I} n t. 

We also define the left leaf G{u) of u and the left ancestors Ag{v) of a leaf v as: 

Gt(u) = 5*('u) n £0(1) and ^*(?;) = {n € A.,; G*(n) = u}. 

For a leaf u € jCo(t), we define its leaf-children as: 

C\v) = {G\m); u € A^v), 1< i < fe„(t)}, 

labeled according to the following order: G^{ui) < G^{u'i') li u < u' in the lexicographic 
order or if u = u' and i < i' . This defines a tree, obtained from the leaves of t, denoted by 
t{o} = F{o}i'^)- Arid we have Card (t^o}) = Lo{t). 

If r is a GW tree then tjo} = -F{o} (t) is also a GW tree with reproduction law given by 
the distribution of Xq in (1351) . 





Figure 1. A tree t on the left and the coding of Co{t) by a tree to = F{t) 
tree on the right. 



5. Conditioning on the number of individuals having a given number of 

children 

Let ^ be a non-empty subset of N. For a tree t € T, we write Cj[{t) = {u £ t; A;„(t) G A} 
the set individuals whose number of children belongs to A and L_^{t) = Card (/3y^(t)) its 
cardinal. The case A = {0} represents the set of leaves of t and has been treated in Section 
4.41 We can also have L_/[{t) = Card (t) by taking ^ = N or L_4 can also be the number of 
internal nodes by taking ^ = N*. 
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Let US first remark that for every t G Tg, every x G Co(t) and every t € T 

-^^ ^ • " \Lj(t) + L^(t) ilOiA, 

and lience L^^ satisfies the additive property ([9]) with D{t,x) = Lj\,{t) — l{QeA}- 

Theorem 5.1. Let t be a critical GW tree with reproduction law p satisfying Assumption 
^. Let d_A be the span of the random variable L_a{t) — 1. Then we have 

(35) dist(r|L^(r) = nd^ + 1) — > dist{T*). 

and 

(36) dist{T\LA{T) > n) — > dist{T*). 

Remark 5.2. U ^ A, then d_A = 1. 

Remark 5.3. Arguing as in Remark 14.41 we get 

(37) lim nMllSn + l)=i. 

^ ^ n^+oo F{La{t) > n) 

In what follows, we denote by X a random variable distributed according to p. To avoid 
triviality, we assume that < ¥{X G ^) < 1. The proofs differ if € ^ or not, the next 
two subsections are devoted each to one case. Then, we consider the same problem for the 
sub-critical case, m = K[X] < 1, in a third subsection. 

5.1. Proof of Theorem 15.11 when G ^. We consider only F{X G ^) < 1, as the case 
¥{X G .4) = 1 corresponds to the critical case with .A = N of Section 14.31 In this case, 
extending the decomposition of [H], it is easy to see that -^^(r) is distributed as the total 
size of a critical GW tree r^ whose reproduction law is given by the distribution of: 

Af-l 

(38) XA=Y,yk + Y", 

k=l 

where {Y^,k G N*), A^ and Y" are independent random variables such that {Y^,k G N*) 
are independent and distributed as X — 1 conditionally on {X ^ A}, N is geometric with 
parameter ¥{X G A) and Y" is distributed as X conditionally on {X G A}. See Remark 15.41 
below for a construction of the corresponding GW tree. 

As IE[X_4] = 1, we get that tj_ is critical. Remark that (i_4 is also the span of Xyi_. Consider 
^0 = dyi_ which gives 

A„ = {tGT; L_A{t)e[n,n + dA)}. 

As L_a{t) is distributed as the total size of a critical GW tree, we deduce from Subsection 
Sathat 

(39) limP(Mr),[„,„ + d^)) = 0. lim '•'^l!;' " '" V'" 1 1^ V^" ^ ' 

n-^+oo p(L^(r) G [n,n + d^)) 

and thus by Theorem 13.11 that ()35p holds. D 

Remark 5.4. We shall briefly indicate how one can prove that Lj[{t) is distributed as the 
total size of a Galton- Watson process by mapping the set C_a{t) onto a GW tree, generalizing 
the construction of Remark 14.81 see also [11] for A = [0, n]. 
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We use notations from Remark 14. 8i For a tree t, n E t and v € Cji^{t), we define the left 
descendant of u in >C^(t) and the left ancestors A^g{v) of v as: 

G\{u) = argmin {\w\] w G -B*(n) n £^(t)} and ^^^^(f ) = {u e A^; G\{u) = v}. 

Remark that G^(ti) exists since we supposed that £ A. 
For V € >Cyt(t), we define its children as: 

C* (v) = /^^^(''')' ^ ^ 4^(^)' 1< ^ ^ ^«} if ^ e C{t), 

■^^ ' \{G\{uiy,ueAl^iv),l<i<ku}\J{vi^<i<k,{t)}, ifv^Cit), 

labeled according to the following order: G^(uz) < G^(t('z') if u < u' in the lexicographic 
order or if li = u' and i < i'; G^{ui) < vi' for u € A^ _^{v) and 1 < i' < A;„(t); and wi < vi' 
for 1 < i < i' < A;^(t). This defines a tree, obtained from Cj((t), denoted by t^ = F^{t). 
And we have Card (to) = L^{t). 

If T is a GW tree then t_4 = Fj({t) is also a GW tree with reproduction law given by the 
distribution of X^ in p8]) . 



5.2. Proof of Theorem 15.11 v^^hen ^ A. Notice that in that case, F{X e ^) < 1. For a 
tree t and u € 'C^(t) we define 5^(t) the sub-tree rooted at u with no progeny in A by: 

We set p{A) = P(X ^ A). We define p(0) = p(0) + Efce^P(^) ^nd, for A: G N*, p(/i:) = 
p(/i;)lr^0_4}. Let X be a random variable with distribution p, so that E[X] = E[Xlrj^g^c-i] < 
1. Notice that S^{t) is a GW tree with reproduction law p. We denote by L be the number 
of leaves of S^{t). Elementary computations yields: 

(40) m- **"* - "W+^'i-^) 



1 - E[l] E [X1|X6^)] 



where we used that E[X] = 1 for the last equality. 
We defined the children of n in yi as: 



G\u)=Co{S^{t))r^CA{t). 

The number of children in ^ of the root is then A'^g := Card (C(0)). Notice that conditionally 
given L, N^ has a binomial distribution with parameter {L,p(A)/p{0)). Let: 

z> 

(41) X^ = 5^iVfc, 

A;=l 

where Z' is a random variable distributed as X conditionally on \X G A\^ and (A^^, /c G N*) 
are independent random variables distributed as N^ and independent of Z . Notice that for 
any u € ^a{t) with u ^ ^, its number of children in A, G{u), is distributed as X^- 

Let t' be a GW tree with reproduction law given by the law of X^ and initial size 1. Let 
{Wlik G N*) be independent random variables distributed as Card (r') and independent of 

Nfj). We deduce that Ca{t) is distributed as X]fc=i ^fc- Notice that C-a{t) is empty if TVg = 0. 
By construction, notice the span of X^ is 1. The proof of the next lemma is postponed to 
the end of the section. 
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Lemma 5.5. We have: 

?i->+oo r[W{ = n) 
Then, using Subsection 14. 3[ we have hm„_>._|_oo p(-^y/l~?x) ~ ^^ ^^^^ ihMs we deduce that: 

(«) Um n^^-^W = "),!. 

^ ' n->+oo P(L^(r) = n + 1) 

As the span of X_4 is 1, we consider ng = 1 that is: 

A„ = {t € T; L^(t) = n}. 



Since r is a.s. finite, the first part of Condition (j39p is satisfied; and (|43p readily imphes the 
second part of Condition (|39l) . As usual, we deduce that (|35]) holds. D 



Proof of Lemma \5.5[ We have, using (j40p : 

E[X^] = E[Z']E[iVfc] 



^ -lEiXlixe^jlEiZ]- ^^-^^ 



= 1. 

Let S'n = Ylk=i(-^'k ~ 1)' where (X^, fc € N*) are independent random variables distributed 
as X^. By construction and using Dwass formula (|25p . we have: 

P(L^(r) = n) = 5^P(iV0 = fc)P K^Ty; = n = -J2k¥{N<^ = k)F{S'^ = -k). 

k=l \j=l J fc=l 

We set: 

X (t.\ _ ^i^n = -k) 

"^^ " ns'n = -1) 

The random walk {S'^^n G N) is recurrent and aperiodic as the span of X_4 is 1. We deduce 
from the strong ratio theorem (j27p that: 

(44) lim 5n{k) = 1. 

n-^+(X) 

Using Fatou's Lemma, we get: 

liminf ^^^^^"^^ = ^^ > Y, km^ = k) liminf 5n{k) = m^]. 

As W[ is the total size of a Galton- Watson process with reproduction law given by X_4, 
we have that W[ has the same distribution as: 

1 + Y.^i 

k=l 
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Using Dwass formula ()25p . we have: 



n,.;.„.„_g^^^^^^.;(E-..^i^.. 



+00 



fc=i 
Let mo > 1 such that F(Z' = rn-o) > 0. We deduce from ([3T|) that: 

{iVi = A;,iV2 = 0, . . . ,iV^o = 0, Z' = mo} C {X^ = k}, 
and thus, with C^^ = P(iV0 = 0)""o-^P(Z' = mo) > we have for ah /c G N: 

P(iV0 = A;) < CP(X^ = A:). 
We get for any K eW: 

l^i^P^ = f2kF{N^ = k)6nik)+ Y. kF{N, = k)6n{k) 

^ 1 ' k=l k=K+l 

K +00 

<Y,knN^ = k)5n{k) + C Y, k¥{Xj^ = k)5n{k) 

k=l k=K+l 

K K 

= J2 km, = k)5n{k) -cY mxA = k)6nik) + c^^l^^^. 

k=l k=l ^ 1 ^ 

Using (jM]), we get: 

umsup ^^^;;^;^_=^^ <f;fcp(iV0 = k) -cYmxA = k)+c. 

Letting K goes to infinity and using that E[X_4] = 1, we get: 

P(L 4(r) =n) 

This imphes (02]). D 

5.3. The sub-critical case. Let p be a reproduction law. For every > such that 
Z^feeN (^^p{k) is finite, we define on N the function po by 

' CA{0)0''p{k) ifA;G^, 
^p{k) [fk^A 

where the normalizing constant c^(^) is given by: 

... ^-Ek^A(^'-'pik) 

"^^^ ~ EkeAO'pik) ■ 
We denote by / the set of 9 such that pg defines a probability distribution on N. Notice 
that / contains (0, 1]. 

Proposition 5.6. Let t be a GW tree with reproduction law p satisfying p{0) > and 
p{0) +p(l) < 1. For every 6 £ I, let tq he a GW tree with reproduction law pg. Then the 
conditional distributions of t given {Ca{t) = n\ and of tq given {CAiTe) = n} are the same. 



Vfc > 0, pe{k) - . 
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Remark 5.7. This proposition covers Kennedy's result [8] for ^ = N and the pruning proce- 
dure of [U for ^ = {0}. 

Proof. Let t € Tq. Then we have, using the definition of pg and ([2]): 

nre = t) = llpe{kM} 
vet 

= n cA{9)e''^^'^p{k,{t)) n o'^^('^''p{k,{t)) 

v£t,ky(t)eA vet,ky{t)<^A 

We deduce that 

nLAire) = n)= ^ ¥{Te = t) 

teTo, LA{t)=n 

= ri(0c^(0)rP(L^(r)=n) 
and finally, for every t € Tq such that L_A{t) = n, we have 



nLAire) = n) 

9-HecA{e)mr 



(r = t I Lj({t) = n). 



e-^iecAie))^F{LAiTe) = n) 

n 

Corollary 5.8. Let t be a sub-critical GW tree with reproduction law p satisfying Assumption 
1^. For every 9 £ I, let tq be a GW tree with reproduction law pg. If there exists 6q £ I such 
that pfff^ is critical, then 

(iist(r|L_4(r) = ndj( + 1) — > dist{Tg^). 

and 

dist(T\L_A{T) > n) — > dist{Tg^). 

6. Conditioning by the size of a high generation 

We end this paper with a conditioning which does not enter into the framework of Theorem 
13. li However its proof can be easily adapted. For a tree t, we denote by 

C/„(t) = Card {{u € t, |m| = n}) 

the size of the n-th generation of t. Then we have 

Proposition 6.1. Let t be a critical GW tree with reproduction law p satisfying Assumption 
(5^. Let {an,n E N) be a sequence of positive integers. If 

(45) Inn nB„Mr) = -'.) ^ , 

"-+~ P(e„(T) = a„) 
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then we have 

(46) dist{T\gnir) = an) — > dist{T*). 

Proof. For every tree t G Tq, every x G Co(t) and every tree t € T, we have 

gnit®it,x)) = gnit) + gn-\4i) 

which generahzes Assumption ([9]). 

The same computations as in the proof of Theorem 13.11 give for t € Tq, x G >Co(t) and 
n > H{t): 

P(t E T(t, x), ^„(r) = an) = -^^(^ = t)IP(^n-|x|(T) = a„ - ^^(t)) 

= P(r*eT(t,x))P(e„_|,|(T) = an). 
Therefore, we obtain by Assumption ()45p : 

Hrn P(rGT(t,x)|^„(T) = a„)= Urn P(r* G T(t,x))^^^^^^;^^^^ 

= P(t* eT(t,x)). 

The result fohows from Lemma |2.1[ D 

Corollary 6.2. Let t be a critical GW tree with reproduction law p given by a mixture of a 
geometric distribution with parameter p G (0, 1] and a Dirac mass at 0, i.e. p{0) = 1 — p and 
p{k) = /)^(1 — p)^^^ for k > 1. Let (a„,n G N) be a sequence of positive integers such that 
liuin^+oo n~'^ an = 0. Then we have: 

dzst(r|t?„(T) = On) — > dist (r*). 

n— >+oo 

Proof. In that particular case, the generating function ipn of ^n(''") is explicitely known and 
we have for every s € [0, 1] 

nc— (nc — l)s 

fn{s) = 7 — T 

[nc + Ij — ncs 
with c = (1 — p)/p. Expending (pn gives for every k > 1: 

mM = k) = J=^, 

and therefore 

j.^^ Fjgn+ijr) = an) _ ^.^^ n(nc+l) f 1 + ^ ^ _^ 



n^+^ F{gn{T) = an) n^+^ {n + l)((n + l)c + 1) \ 1 + 7;^ 



(n+l)c , 



D 



Remark 6.3. As for Theorem 13. H we can obtain the converse of Proposition 16.11 We deduce 
that, in the geometric case, the Galton- Watson tree r conditioned on {^n(''") = iL"-"]}) with 
j G N*, converges in distribution to Kesten's tree if and only if a G [0, 2). 
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